Lateral Inhibition Overcomes Limits of Temporal Difference Learning
نویسندگان
چکیده
There is growing support for Temporal Difference (TD) Learning as a formal account of the role of the midbrain dopamine system and the basal ganglia in learning from reinforcement. This account is challenged, however, by the fact that realistic implementations of TD Learning have been shown to fail on some fairly simple learning tasks — tasks well within the capabilities of humans and non-human animals. We hypothesize that such failures do not arise from natural learning systems because of the ubiquitous appearance of lateral inhibition in the cortex, producing sparse conjunctive internal representations that support the learning of predictions of future reward. We provide support for this conjecture through computational simulations that compare TD Learning systems with and without lateral inhibition, demonstrating the benefits of sparse conjunctive codes for reinforcement learning.
منابع مشابه
Ictal and Interictal Electroencephalography of Mesial and Lateral Temporal Lobe Epilepsy; A Comparative Study
Background: Epilepsy is considered as one of the most important disorders in neurology. Temporal lobe epilepsy is a form of epilepsy including two main types of mesial and lateral (neocortex). Objectives: Determination and comparison of electroencephalogram (EEG) pattern in the ictal and interictal phases of mesial and lateral temporal lobe epilepsy. Materials and Methods: This cross-sectiona...
متن کاملControl of Multivariable Systems Based on Emotional Temporal Difference Learning Controller
One of the most important issues that we face in controlling delayed systems and non-minimum phase systems is to fulfill objective orientations simultaneously and in the best way possible. In this paper proposing a new method, an objective orientation is presented for controlling multi-objective systems. The principles of this method is based an emotional temporal difference learning, and has a...
متن کاملA Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures
VisNet2 is a model to investigate some aspects of invariant visual object recognition in the primate visual system. It is a four-layer feedforward network with convergence to each part of a layer from a small region of the preceding layer, with competition between the neurons within a layer and with a trace learning rule to help it learn transform invariance. The trace rule is a modified Hebbia...
متن کاملKernel-Based Reinforcement Learning in Average-Cost Problems: An Application to Optimal Portfolio Choice
Peter Glynn EESOR Stanford University Stanford, CA 94305-4023 Many approaches to reinforcement learning combine neural networks or other parametric function approximators with a form of temporal-difference learning to estimate the value function of a Markov Decision Process. A significant disadvantage of those procedures is that the resulting learning algorithms are frequently unstable. In this...
متن کاملProneural enhancement by Notch overcomes Suppressor-of-Hairless repressor function in the developing Drosophila eye
BACKGROUND The receptor protein Notch plays a conserved role in restricting neural-fate specification during lateral inhibition. Lateral inhibition requires the Notch intracellular domain to coactivate Su(H)-mediated transcription of the Enhancer-of-split Complex. During Drosophila eye development, Notch plays an additional role in promoting neural fate independently of Su(H) and E(spl)-C, and ...
متن کامل